Construction and properties of assortative random networks 
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Many social networks exhibit assortative mixing so that the predictions of uncorrected mod- 
els might be inadequate. To analyze the role of assortativity we introduce an algorithm which 
changes correlations in a network and produces assortative mixing to a desired degree. This degree 
is governed by one parameter p. Changing this parameter one can construct networks ranging from 
fully random (p = 0) to totally assortative (p = 1). We apply the algorithm to a Barabasi- Albert 
scale-free network and show that the degree of assortativity is an important parameter governing 
geometrical and transport properties of networks. Thus, the diameter of the network and the clus- 
tering coefficient increase dramatically with the degree of assortativity. Moreover, the concentration 
dependences of the size of the giant component in the node percolation problem for uncorrelated 
and assortative networks are strongly different. 

PACS numbers: 05.50.+q, 89.75.Hc 



INTRODUCTION 



Complex networks have recently attracted a burst of 
interest as an indispensable tool of description of differ- 
ent complex systems. Thus, technological webs as the 
Internet and the World Wide Web, as well as other nat- 
ural and social systems like intricate chemical reactions 
in the living cell, the networks of scientific and movie 
actors' collaborations, and even human sexual contacts 
have been successfully described through scale-free net- 
works, networks with the degree distribution P(k) ~ k 1 
[1,2]. The degree distribution P(k) is one of the essen- 
tial measures used to capture the structure of a network, 
and gives the probability that a node chosen at random 
is connected with exactly k other vertices of the network. 

Recently, it was pointed out that the existence of de- 
gree correlations among nodes is an important property 
of the real networks [3-15]. Thus, many social networks 
show that nodes having many connections tend to be 
connect with other highly connected nodes [4,6]. In the 
literature this characteristics is usually dented as assor- 
tativity, or assortative mixing. On the other hand, tech- 
nological and biological networks often have the prop- 
erty that nodes with high degree are preferably connected 
with ones with low degree, a property referred to as dis- 
sortativity [3,7]. Such correlations have an important in- 
fluence on the topology of networks, and therefore they 
are essential for the description of spreading phenomena, 
like spreading of information or infections, as well as for 
the robustness of networks against intentional attack or 
random breakdown of their elements [16-21]. 

In order to assess the role of correlations, especially of 
the assortative mixing, several studies have proposed pro- 
cedures to build correlated networks [3,23-25]. The most 
general of them are the ones proposed by Newman [3], 
and by Boguna and Pastor-Satorras [25] , who suggest two 
different ways to construct general correlated networks 
with prescribed correlations. Following the same goal, we 



however adopt a different perspective in this paper. We 
propose a simple algorithm producing assortative mix- 
ing, in which, instead of putting correlations by hand, 
we only try impose the intuitive condition that "nodes 
with similar degree connects preferably" . We then in- 
vestigate the correlations which come out of our simple 
model. Thus, we present an algorithm, governed by the 
only parameter p, capable to generate assortative corre- 
lations to a desired degree. In order to study the effect 
of the assortative mixing, we apply our algorithm to a 
Barabasi- Albert scale- free network [26], the one leading 
to the degree distribution P(k) ~ fc -3 , and investigate 
the properties of the emerging networks in some detail. 

THE ALGORITHM 

In what follows we treat undirected networks. Start- 
ing from a given network, at each step two links of the 
network are chosen at random, so that the four nodes, 
in general, with different degrees, connected through the 
links two by two are considered. The step of our algo- 
rithm looks as follows. The four nodes are ordered with 
respect to their degrees. Then, with probability p, the 
links are rewired in a such a way that one link connects 
the two nodes with the smaller degrees and the other con- 
nects the two nodes with the larger degrees, otherwise the 
links are randomly rewired (Maslov-Sneppen algorithm 
[11]). In the case when one, or both, of these new links 
already existed in the network, the step is discarded and 
a new pair of edges is selected. This restriction prevents 
the appearance of multiple edges connecting the same 
pair of nodes. A repeated application of the rewiring 
step leads to an assortative version of the original net- 
work. Note that the algorithm does not change the de- 
gree of the nodes involved and thus the overall degree 
distribution in the network. Changing the parameter p, 
it is possible to construct networks with different degree 
of assortativity. 
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CORRELATIONS AND ASSORTATIVITY 



Let £ij be the probability that a randomly selected 
edge of the network connects two nodes, one with degree 
i and another with degree j. The probabilities £y de- 
termine the correlations of the network. We say that a 
network is uncorrelated when 



Sij (2 Sij ) 



iP(i) jPjj) 
(i> (j) 



(1) 



i.e, when the probability that a link is connected to a 
node with a certain degree is independent from the de- 
gree of the attached node. Here (i) — (j) denotes the 
first moment of the degree distribution. 

Assortativity means that highly connected nodes tend 
to be connected to each other with a higher probability 
than in an uncorrelated network. Moreover, the nodes 
with similar degrees tend to be connected with larger 
probability than in the uncorrelated case, i. e., Su > £J 
Vi. The degree of assortativity of a network can thus be 
characterized by the quantity [3] : 



A = 



^2i £ii ^2i £j; 



(2) 



which takes the value when the network is uncorrelated 
and the value 1 when the network is totally assortative. 
(Note that finite-size effects and the constraint that no 
vertices are connected by more than one edge bound A 
from above by the values lower than 1 [22]). 

Now, starting from the algorithm generator, we can 
obtain a theoretical expression for Sij as a function of p. 
Let Eij be the number of links in the network connect- 
ing two nodes, one with degree i and another with degree 
j, so that Sij = Eij/L, where L is the total number of 
links of the network. (Since undirected networks satisfy 
Eij = Eji, the restriction i < j can be imposed without 
loss of generality). We now define the variable 



F h 



£5> 



r < s 



I < n 



(3) 
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A careful analysis of the algorithm reveals that, every 
time the rewiring process is applied, Fi n either does not 
change, or changes increasing or decreasing by unity. We 
can then calculated the probabilities that it changes, i. 
e., that Fi n — * Fi n +1 or Fi„ — * Fi n — 1. Here, the effect of 
multiple edges can be disregarded since they are rare in 
the thermodynamical limit of infinite networks. Taking 
into account all corresponding possibilities, we obtain for 
the probabilities of changes the following expressions: 



{Xln - fin) 2 + P {Xl n - f\ n + /l,2-l) 2 



for Ft 



Fi, 



1 and 



f ln [(1 - P )(l - 2X ln ) +p(X 1 ,_ 1 - /!,,_! - f ln ) + f lr , 



for Fi n — * Fi n - 1. Here /;„ = Fi n /L, and X tn is given 
by: 

1 - 

X in = — VfcP(fc) l<n . 

{k) k=i 

(Note that Xi n and /;„, vanish when one of the indices 
is smaller than 1, the minimal tolerated degree). Using 
this, we can calculate the expected value for The 
process of repeated applying our algorithm corresponds 
to an ergodic Markov chain, and the stationary solution 
in the thermodynamical limit is given by the condition: 

(Xln - fln) 2 +P {Xln - fln + fl,l-lf = (4) 
fln [(1 - P )(l - 2X ln ) + P (X U _ 1 - /!,!_! - f ln ) + fln] 

for all I > 1. For I = 1 this condition reduces to 

(1 +p) (X ln - fin) 2 = (1 ~p)hn [1 - 2X ln + f ln ] . (5) 

Using Eq. (4) and Eq. (5) we can calculate //„. The 
solutions reads 



fln 



X ln + ( B n - B n -l) 



(l-p)/2+ P Xi n + Bn+B n -l 



I < 



with 



pXin 



1-p 



PX 2 ln 



1+P 



Applying the definition, Eq.(3), we obtain the correla- 
tions: 



S t j — f,.j — f%,j-\ — fi+i,j + fi+i,j-i- 



(6) 
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FIG. 1. The lower curve corresponds to the measured as- 
sortativity A of our simulations, whereas the upper curve cor- 
responds to the theory. We note that both curves coincide for 
A < 0.7. Above this value the finite-size corrections get im- 
portant, leading to the measured value of „4 < 1 for p — » 1. 
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Finally, note that Eq.(6) reduces to the corresponding 
uncorrelated case when p — 0, and reduces to 



for the case p = 1 . 



SIMULATION RESULTS 

In our simulations we apply our algorithm to the 
Barabasi- Albert network [26] with N = 10 5 nodes and 
L = 2 ■ 10 5 links. We measure £y as functions of p, 
and use them to calculate the corresponding values of A. 
All simulation results are averaged over ten independent 
realizations of the algorithm as applied to the original 
network. 

Fig. 1 shows the relation between the parameter p 
and the coefficient of assortativity A. The lower curve 
corresponds to the measured assortativity, and the up- 
per to our theoretical prediction. Both curves coincide 
for A < 0.7. However, whereas the theoretical curve 
reach the value 1 for p — > 1, the measured assortativ- 
ity increases until the maximal value smaller than one 
(.A — > 0.913) is reached. This was theoretically expected, 
and is due to the finite-size effects mentioned above. 
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FIG. 2. £kk as a function of k for different values of A. 
From bottom to top: A = 0, 0.221, 0.443, and 0.640. The 
points are the results of the simulations and the curves corre- 
spond to the theory. 

To assess the goodness of the Eq. (6) we compare in 
the Fig. 2 the theoretical values of £kk, given by Eq. (6), 
with the simulations. The points correspond to the sim- 
ulations and the curves are the corresponding theoretical 
results obtained based on the actual degree distribution 
of a particular realization of the network discussed. We 
note that the agreement is really excellent. 



Diameter. — The diameter of a network is the average 
distance between every pair of vertices of the network, 
being defined as the number of edges along the shortest 
path connecting them. Uncorrelated scale-free networks 
show a very small diameter, typically growing as the log- 
arithm of the network's size. For networks with N ~ 10 5 
the diameter is about d ~ 6. The results of the simu- 
lations show that the diameter grows rapidly when the 
assortativity of the network increases (fig. 3), so that 
it becomes hundred times larger than for the uncorre- 
lated network when the coefficient of assortativity tend 
to its maximal value. In the inset we plot the diameter 
as a function of /C — A, where K, = 0.913 corresponds 
to this maximal value of A attainable in the network. 
For our particular Barabasi- Albert network we thus have 
d cx (0.913 - Ay 1 - 12 . 
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FIG. 3. Diameter of the network versus coefficient of as- 
sortativity. We note that the diameter grows rapidly when 
A increases. In the inset the diameter is plotted on double 
logarithmic scales as function of K, — A, being K, = 0.931. The 
slope of the straight line is 1.12. 

Clustering coefficient. — Clustering coefficients of a 
network are a measure of the number of loops (closed 
paths) of length three. The notion has its roots in soci- 
ology, where it was important to analyze the groups of 
acquaintances in which every member knows every other 
one. To discuss the concept of clustering, let us focus first 
on a vertex, having k edges connected to k other nodes 
termed as nearest neighbors. If these nearest neighbors 
of the selected node were forming a fully connected clus- 
ter of vertices, there would be k(k — l)/2 edges between 
them. The ratio between the number of edges that really 
exist between these k vertices and the maximal number 
k(k — l)/2 gives the value of the clustering coefficient of 
the selected node. The clustering coefficient of the whole 
network C is then defined as the average of the cluster- 
ing coefficients of all vertices. One can also speak about 
the clustering coefficient of nodes with a given degree k, 
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referring to the average of the clustering coefficients only 
over this type of nodes. We shall denote this degree- 
dependent clustering coefficient by C(k), to distinguish 
it from C. 




FIG. 4. C(k) as a function of the degree of nodes k. Differ- 
ent curves correspond to different values of A. From bottom 
to top A = 0, 0.069, 0.221, 0.443, 0.640, 0.777, 0.856, and 1. 
Inset: clustering coefficient C versus the degree of assortativ- 
ity A. 

Fig. 4 shows the variation of both clustering coeffi- 
cients with the assortativity of the network. The cluster- 
ing coefficient C increases with the assortativity (inset of 
the figure). The variation of C(k) shows more interest- 
ing features. The simulations show a peak around k = 90 
(probably a finite size effect) whose height increases with 
the assortativity of the network. In the uncorrelated case 
C(k) does not depend on k [13], but a strong tendency to 
clustering (for relatively large k) emerges when A grows. 
We also observe in our simulations that C(k = 2) = 
when A ~ 1 (k = 2 correspond to the minimal degree of 
our vertices) . This is not surprising since in a strongly as- 
sortative case almost all nodes with degree k = 2 are con- 
nected between themselves, forming one or several large 
loops of length larger than three. This means that all 
nodes having this minimal degree (in our simulations the 
half of the total number of vertices) do not tend to con- 
tribute to the clustering coefficient C. 

In the present contribution we concentrate on the in- 
vestigation of the properties of the proposed algorithm. 
However, we suggest, in relation to real networks, a sim- 
ple modification of the algorithm, that perhaps could 
be useful. Thus, in order to generate assortativity only 
among highly connected vertices, one can apply the algo- 
rithm above only when at least one of the four nodes se- 
lected at the corresponding step has a degree larger than 
some chosen k. Provided all four nodes have a smaller 
degree, the the Maslov-Sneppen step is used. This pro- 
cedure could lead to a larger value for the clustering co- 



efficient, as it is observed in real networks (C > 0.1) 
[1]. The last ones might, however, have a much more 
intricate structure, partly governed by the metrics of the 
underlying space, as in the models discussed in [27], so 
that caution has to be exercised when applying results of 
theoretical models disregarding metrical relations to real 
networks. 
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FIG. 5. Fraction of nodes M in the giant component de- 
pending on the fraction of nodes removed from the network. 
The graph compares the results for different degrees of assor- 
tativity. From top to bottom: A = 0, 0.069, 0.221, 0.443, 
0.640, 0.777, 0.856, and 1. 



Node percolation. — Node percolation corresponds to 
removal of a certain fraction of vertices from the net- 
work, and is relevant when discussing their vulnerability 
to a random attack. Let q be the fraction of the nodes 
removed. At a critical fraction q c , the giant component 
(largest connected cluster) breaks into tiny isolated clus- 
ters. Fig. 5 shows the fraction of nodes M. in the giant 
component as a function of q for different degrees of as- 
sortativity of the network. We note that the behavior of 
Ai(q) changes gradually with A from the uncorrelated 
case (upper curve) to a quite different behavior when 
A — ► 1 (lower curve), which indicates a very different 
topology in the network when it is strongly assortative. 
However, although the particular form of the M. depen- 
dence is different for different degrees of assortativity, the 
absence of the transition at finite concentrations (q c = 1) 
and the overall type of the critical behavior for correlated 
networks with the same P(k) seems to be the same as for 
uncorrelated networks, namely the one discussed in Refs. 
[28,29]. We also point out that in case A c± 1, a finite 
network is no longer fully connected: part of the nodes 
does not belong to the giant component even for q = 0. 
The results suggest that, in the thermodynamical limit, 
the giant cluster at q — contains around a half of all 
nodes, and that its density then decays smoothly with q. 
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CONCLUSIONS 

In summary, we present an algorithm to generate as- 
sortatively correlated networks. In the termodynamical 
limit we obtain a theoretical expression for the generated 
correlations, which only depends on the degree distribu- 
tion of the network and on the turnable parameter p of 
the algorithm. Finally, we show that assortative correla- 
tions have a drastic influence on the statistical properties 
of networks, changing strikingly their diameter and clus- 
tering coefficient, as well as their percolation properties. 

We also indicate that with a minor change in our algo- 
rithm one can produces dissortativc mixing too. The only 
change would be the following: after ordering the nodes 
with respect to their degree, one rewires, with probability 
p, the edges so that one link connects the highest con- 
nected node with the node with the lowest degree and 
the other link connects the two remaining vertices; with 
probability 1 — p one rewires the links randomly. 
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